TRAIT SUBSTITUTION TREES ON TWO TIME SCALES ANALYSIS 

ANTON BOVIER AND SHI-DONG WANG 

Abstract. In this paper we consider two continuous-mass population models as ana- 
logues of logistic branching random walks, one is supported on a finite trait space and 
the other one is supported on an infinite trait space. For the first model with nearest- 
neighbor competition and migration, we justify a well-described evolutionary path to 
the short-term equilibrium on a slow migration time scale. For the second one with 
an additional evolutionary mechanism-mutation, a jump process-trait substitution tree 

fvj model is established under a combination of rare mutation and slow migration limits. 

The transition rule of the tree highly depends on the relabeled trait sequence determined 
by the fitness landscape. The novelty of our model is that each trait, which may nearly 

■^^ die out on the migration time scale, has a chance to recover and further to be stabilized 

on the mutation time scale because of a change in the fitness landscape due to a newly 
entering mutant. 
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1. Introduction 

In recent years a spatially structured population with migration (namely mutation in 
[3J) and local regulation proposed by Bolker and Pacala [T], Dieckmann and Law [7J 



Date: April 18, 2013. 

2000 Mathematics Subject Classification. 92D25, 60J85, 37N25, 92D15, 60J75. 

Key words and phrases, continuous mass population, slow migration, rare mutation, trait substitution 
tree, fitness structure. 

A. Bovier is supported by in part by the DFG in SPP 1590 "Probabilistic Structures in Evolution" 
and the Hausdorff Center for Mathematics. S.-D. Wang was supported by a Hausdorff Scholarship while 
at the University of Bonn, and EPSRC Grant EP/I01361X/1 while at the University of Oxford. 

1 



TRAIT SUBSTITUTION TREE 2 

(BPDL process) has attracted particular interest both from biologists and mathemati- 
cians. It has several advantages over the traditional branching processes, which make it 
more natural as a population model: the quadratic competition term is used to prevent 
the population size from escaping to infinity, and the mutation term is used to create 
an alternative trait type of the population for selection. Over the last decade, a lot of 
work has been addressing different aspects on this model. For instance, Etheridge [9], 
Fournier and Meleard [TO], and Hutzenthaler and Wakolbinger [H] study the extinction 
and survival problems. Champagnat [3], Champagnat and Lambert jl], Champagnat and 
Meleard [5], Meleard and Tran [16J, Dawson and Greven [6] mainly focus on its long time 
behavior by multi-scale analysis methods. 

The present work is largely motivated by the derivation of macroscopic phenomena 
on the level of populations from the individual based models in the joint limits of large 
population size and small mutation rates. We mention in particular the work of Fournier 
and Meleard [lOJ . Champagnat [3j where under certain conditions convergence to the so- 
called "trait substitution sequence (TSS)" was obtained. More recently, this type of results 
was extended in Champagnat and Meleard j5] to include further evolutionary phenomena 
such as evolutionary branching. A common feature of these works is the following setup: 
one assumes that mutations rates are so small that a monomorphic population, after a 
single mutation event has sufficient time to move to a new equilibrium where either the 
mutant trait gets extinct or the mutant trait fixated and the resident trait gets extinct. In 
this way, one obtains, on the time scale at which such rare mutations occur, a sequence of 
populations evolving towards increasing fitness, the so-called trait substitution sequence. 
In certain singular situations, one may also reach an equilibrium with co-existing traits, 
leading to the above-mentioned phenomenon of evolutionary branching [5] . 

What we wish to add to this picture in the present paper is a more complex structure 
of populations. The general idea is to consider populations with multiple traits where 
individual may change (upon birth or otherwise) between a finite set of traits at a given 
(population size independent) rate. We will term such switches "migrations". In addition, 
there are rare mutations where an individual can be born with a new trait which has never 
been existing in the population. 

This set-up is motivated from ideas that are currently discussed intensely in cancer 
research. The migration events can be interpreted as epigenetic switch in the gene- 
expression of a cell between a variety of possible "metastable" state (see e.g. Huang [13] 
and Gillies et al [H] and references therein). Mutations are then true mutations that 
lead to a change in the epigenetically accessible trait-space. See also Holzel et al for a 
discussion in the context of cancer evolution [12] . In this paper we consider a very simple 
caricature of such a complex situations. Our purpose here is limited to showing that such 
models are still accessible to the mathematical methods developed in recent years, and 
that such systems give rise to new and interesting mathematical structures. 

In this paper we investigate the long term behavior in a two-step limiting procedure 
where we first let the population size tend to infinity, and then let the migration rate tend 
to zero while rescaling time in an appropriate way to obtain a non-trivial limit. For a finite 
trait space, specific conditions are imposed on the fitness and demographic parameters, 
and a well-described evolutionary path to approach the short-term equilibrium will be 
obtained on an appropriate time scale. The noteworthy feature here is that these equilibria 
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can be polymorphic. We call this process a trait substitution tree (TST) on the finite trait 
space. For any given sequence of traits, the equilibrium configuration is determined by 
their labeled order according to their fitness landscape. 

In a second step, we add random mutations on a longer time scale. This is modeled 
here as the appearance of mass at hitherto unoccupied locations in trait space driven by 
some Poisson process. The effect of the appearance of such new mass is a reshuffling of the 
migration part of the process that ends in a new equilibrium configuration. As this process 
continues, we obtain what we call the trait substitution tree (TST) process on infinite 
state space. The somewhat artificial introduction for mutations in the infinite population 
model is motivated on the basis of a limit of a finite population model with migration and 
mutation rates at distinct time scales. Such a model is studied in a companion paper [2] . 

The remainder of the paper is organized as follows. In Section [2j we briefly describe 
the microscopic model and give some preliminary results. In particular, we recall the law 
of large numbers of the BPDL processes. In Section |3j as e tends to 0, on a finite trait 
space we retrieve a well-defined short-term evolution path to its TST configuration on the 
migration time scale O (in |J. In Section 4l under the rare mutation constraint we obtain 
a jump-type TST process on a longer time scale-the mutation time scale. In Section [5j we 
provide proofs of the results in Section [3] and Section [4} Finally, for better understanding 
the TST process we provide a simulation algorithm in Section [6j 

2. Microscopic model 

2.1. Notation and description of the processes. Following [1], we assume the pop- 
ulation at time t is composed of a finite number I(t) of individuals characterized by their 
phenotypic traits Xi(t), ■ ■ ■ , Xi(t)(t) taking values (which can be equal) in a compact subset 
X of R d . 

We denote by JHf(X) the set of non-negative finite measures on X. Let Ai a (X) C 
Mf{X) be the set of atomic measures on X: 



M a {X) = i J] 8 Xi : xi, • • • , x n G X, n G N 



Then the population process can be represented as: 

i(t) 

Let B(X) denote the totality of bounded and measurable functions on X. Let B(A4f(X)) 
(and B(J\4 a (X))) be totality of bounded and measurable functions on Jii F (X) (and 
M a {X)). For v G M F (X) and G B(X), denote by (u,<f>) = J <f>du. 

Let's specify the population processes (u^) t >o by introducing a sequence of demographic 
parameters, for nG N: 

• b n (x) is the rate of birth from an individual with trait x. 

• d n (x) is the rate of death of an individual with trait x because of "aging". 

• a n (x, y) is the competition kernel felt by some individual with trait x from another 
individual with trait y. 
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D n (x,dy) is the children's dispersion law from its mother with trait x. In partic- 
ular, it can be decomposed into two parts-local birth at location x and a small 
portion of migration based on birth, i.e. 

D n (x, dy) = (1 - e)l x=y + em n (x, dy)\ x+y . (2.1) 

Here, m n (x,dy) is the transition law for migration, which satisfies 



m n (x,dy) = 1. 



yex 



We will omit the superscript e in D n in the sequel when this leads no ambiguity. 
Fournier and Meleard pU] formulated a pathwise construction of the BPDL process 
{{ v t)t>o'i n £ N} in terms of Poisson random measures and justified its infinitesimal 
generator defined for any $ G B(Ai a (X)): 



!%*{*) 



u(dx) / [$0 + S y ) - $(»] b n {x)D n (x, dy) 



x 
+ / u(dx) [$(1/ - S x ) - $(*/)] 



d n (ar) + / a n (x,y)p(dy) 

x 



(2.2) 



The first term is used to model birth events, while the second term which is nonlinear is 
interpreted as natural death and competing death. 

Instead of studying the original BPDL processes defined by (2.2), our goal is to study 
the rescaled processes 



X-:- 



v* 



n 



t> 



(2.3) 



since it provides us a macroscopic approximation when we take the large population 
limits (we will see later, the initial population is proportional to n in some sense). The 
infinitesimal generator of the rescaled BPDL process has the following form, for any 
$ e B(M F (X)): 

/* r a 

nv(dx) \ $(z/ + — ) - $(z/) 



L n $(v) 



x Jl 

+ / nu(dx) 
Jx 



n 



b n (x)D n (x,dy) 



<f>(v 
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X - 

n 



®(iA 



d n (x) + / a n (x,y)nv(dy) 
x 



(2.4) 



2.2. Preliminary results. Let's denote by (A) the following assumptions: 

(Al) There exist b(x), d(x), m(x) G B(X), a(x,y) G B(XxX) withm(x) a probability 
density for x, y G X, n G N, such that 

< b n (x) = b(x), < d n (x) = d(x), m n (x,y) <m(y), 

a(x,y) 



< a n (x,y) 



n 



(A2) b{x) - d{x) > 0. 

The first assumption implies that there exist constants b, d, a such that b(x) < b, d(x) < 
d, a(x,y) < a. Furthermore, it guarantees the existence of the BPDL process (see [TO]). 

By neglecting the high order moment, Bolker and Pacala pQ use the "moment closure" 
procedure to approximate the stochastic population processes. As we can see from the 
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generator formula (2.4), due to the quadratic nonlinear term, it should be enough to set 
the third order moments to be uniformly bounded and "close" the equation up to second 
order moment . Then Fournier and Meleard [10] obtain a deterministic measure-valued 
process in the large population limit. 

Theorem 2.1 (Fournier and Meleard [10], convergence to an integro-differential equa- 
tion). Under the assumption (Al), consider a sequence of processes (X™) t >o defined in 



(2.3). Suppose that (Xq) converges in law to some deterministic finite measure X G 
M.p(X) as n — > oo and satisfies supE(X^,l) 3 < oo. Then the sequence of processes 

n>l 

(X t n )j> converges in law as n — > oo, on D([0, oo), M. F (X)), to a deterministic measure- 
valued process (X t )t>o G C([0, oo), M.p(X)), where (X t ) t > is the unique solution satisfy- 



ing 



sup (X t , 1) < oo, (2.5) 

te[o,T] 



and for any G B(X), 



(X u <f>) =(X O ,0)+ f ds f X s (dx)b(x) I (j)(y)D(x,dy) 
Jo J X 

ds I X s (dx)cj)(x 
! <i Jx 



d(x) + I a(x,y)X s (dy) 
x 



(2.6) 



3. TST ON A FINITE TRAIT SPACE: WITHOUT MUTATION 

The trait substitution sequence (TSS) model is a powerful tool in understanding various 
evolutionary phenomena, such as evolutionary branching which may lead to speciation 
(see Champagnat and Meleard [5]). Moreover, the population follows the "hill climbing" 
process on the increasing fitness landscape, and holds monomorphic trait on a long time 
scale. This model is proposed by Metz et al. JTT] (so called "invasion implies fixation") 
and mathematically studied by Champagnat et al. [3, SJ HE] . 

Notice that the dispersal kernel D(x, dz) implicitly depends on a parameter e (see 



(2.1)). Rather than taking large population and rare migration limits simultaneously as 
in [3], we justify a so-called trait substitution tree (TST) from a macroscopic point of 
view. More precisely, we first consider the large population limit to attain a macroscopic 



approximation of the individual-based model (see Theorem 2.1). Then, we consider the 
slow migration limit by a rescaling procedure based on the macroscopic limit. In contrast 
to the model in Champagnat [3], the migration rate here is not constrained in terms of 
the demographic parameter (population size). 

Here, the so-called TST process arises under the slow migration limit when we assume 
the nearest-neighbor competition. Note that a variety of short-term evolution paths can 
be attained by specifying different competition strengths. In other words, the order of 
invasion and recovery has no special significance even though in this section we restrict 
the picture by forward invasion into the fitter direction and backward recovery into the 
unfit direction along the fitness landscape. However, these paths are indistinguishable 
on a longer scale-the mutation time scale followed by the next section. Nevertheless, 
apart from the interesting tree structure the TST model also brings us some insights into 
speciation phenomena - evolution from a monomorphic ancestor to diverse species. 
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Denote by (C) the following assumptions: 

(CI) Assume X = {x ,Xi,x 2 , ■ ■ ■ ,xl} comprised of distinct traits with index up to 

L G N. Monomorphic initial trait: Xq = — -8 Xo , and — Q — > £(^0) as n — )• oo. 
(C2) Nearest-neighbor competition and migration: a(xi,Xj) = m(xi,Xj) = for \i—j\ > 

1, and 

x -< X\ -< ■ ■ ■ -< X L -x < XL (3.1) 

where Xi-\ -< Xi means /i,j_i > 0, /j_i,j < for any 1 < i < L with fitness function 

fij := b(xi) - d(xi) - a(xi,Xj)€(xj), and £(xj) := ^if'^f ■ 
(C3) For any i > 2, 



> 



1 



b(xi) - d{xi) f it i-i 
(C4) For any i > 0, j /i -' +ll i < 1, and 

\fi,i+l\ 1 



+ 



fi-l,i-'< 



+ ••• + 



1 



> 



/l,0 

|/i+l,i+2| 



(3.2) 



(3.3) 



fi+2,i+i(b(xi) - d(xi)) fi+3,1+2 " f i+ 3,i+2(b(x i+ i) - d(x i+1 )) ' 

Notice that (C3-C4) are just technical assumptions for results in this section but not 
necessary for results in next section. In fact, assumption (C3) guarantees that the pattern 
for fixation of fitter traits is in a form of one-by-one replacements until the fittest trait 
rather than immediate establishments (see proof of Proposition 5.2 and Proposition 5.3). 
(C4) implies that the recovery time of trait Xi is later than that of type Xi+% (see Lemma 



5.4). 



We first consider the macroscopic limit (2.6) which involves the parameter e > 0, and 



rewrite it in another form, for any <p G B(X), 



(A7,0)=(X O , 



f I ds Xg(dx)b(x) / [4>{y) — <j>(x)] m{x, dy) 

'o J X JX 



I ds / X e s (dx)<j){x) 

'0 JX 



b(x) - d(x) - I a(x,y)X € s (dy) 
x 



(3.4) 



Suppose that the process is supported on a finite trait space 

X = {x ,Xi,--- ,x L }, 

and allow only nearest-neighbour competition and migration. The infinite population size 
limit then yields a dynamical system given by 



tt(xi) =£o{xi) + 



b(xi)-d(xi)- 22 a ( x i> x j)£s( x j) 



j=i±l,i 



£, s (xi)ds 



^ \b{x j )^ s (x j )m(x j , xi) - b(xi)£ s (xi)m(xi, Xj)\ ds, 



Ki< L. 



j=i±i 



(3.5) 



Global existence and uniqueness of the processes follows from Theorem 2.1 



In the following theorem, we derive a trait substitution tree model based on the above 
macroscopic approximation by letting e tend to zero while rescaling time. 
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Theorem 3.1. Admit assumptions (A) and (C), consider the deterministic measure- 
valued processes (Xf) t >a specified by ( |3.5 ) on the trait space X = {xq,xi,x 2 , ■ ■ ■ ,xl}, for 



any L E N. Then the sequence of rescaled processes [X* i I converges, as e — > 0, to 

\ *' ln 7/i>0 

{Ut) t>0 which has the following forms depending on the integer L is even or odd. 
(i) When L = 21 for some IgNUO, 



U t 



{(x )S Xo 
€(x L )6 XL 



forO<t< h, 

for I k <t< h+i, k = 1, 

for I L <t<I L + S L - 2 , 



L-l, 



EC(^2i)^ 2i fori 



2j+2 



S 2j < t < I 2j + S 2 j-2, j = I 



,1, 



(3.6) 



i=j 



E Z(X2i)5 X2i 
V. i=0 



where h 



k 

E 



i=i 



Ji,i- 



for t> I 2 + S . 



and Si, 



l/fc,fc+il 



fk+2,k+i(K x k)-d(x k )) 

(ii) When L = 11 + 1 for some IgNUO, 

forO<t< h, 

for I k <t < 4+i, k 



Ut 



{(Xo)o~x 

iMs Xk 

£(xl)$x l 

l+l _ 

E f(»a-i)Wi 

i=j 

l+l _ 

E^( a; 2 i -l)4 
^ i=l 



J X2i-l 



for I L <t<I L + S L _ 2 , 
for I 2j+1 + S 2 j-i < t < I 2j . 

fort>I 3 + S 1 . 



i + 5 : 



2j-3, J 



(3.7) 



Remark 3.2. (1) As time passes on, the limiting process (U t )t>o starts with monomor- 
phic substitutions up to the domination of the fittest trait. Afterwards, the relatively unfit 
traits start to recover along the fitness decreasing direction. From the fittest trait back to 
the initial one every second one appears in the limit. For instance, when X = {xq, x±, x 2 }, 
the stable configuration has support {xo,x 2 }; when X = {xo,x\,x 2 , x%}, the stable config- 
uration has support {x\,xs} (see Figure^. This is because the competition is restricted 
between nearest neighbors, and the trait on the right hand side is always fitter than the 
traits on the left. 

(2) The TST process indexed by L + 2 can be constructed from the TST process indexed 
by L by adding a three-type sub-tree on top of it. For instance, it is shown in the Figure 
[1 that the TST (when L = A) can be constructed from a smaller TST (when L = 2) by 
connecting another excursion consisting of traits {x 2 , x^jX^}. 



We postpone the proof of the above result to Section |5.1| 

4. TST ON AN INFINITE TRAIT SPACE: WITH MUTATION 

In Secti on [3] we analyze a continuous-mass population on a finite trait space defined by 
equation ( |3.5[ ). On the way towards its equilibrium configuration, under some restrictive 
conditions, a deterministic evolutionary picture arises on the slow migration time scale 
0(lni). 
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FIGURE 1. Numerical simulations of evolution of a dynamical system with monomor- 
phic initial type and finite trait space (the upper left one has X — {xq, x±, x^\ while the 
upper right one has X = {x , xi,x 2l x 3 }). Curves describing &(xo), Ztfai), 6(^2), ftfas) 
are colored black, blue, red, green, resp.. The equilibrium configuration for the first 
case is S Xo + 3S X2 and is 2S Xl + A8 X3 for the second one. The lower panel gives their 
corresponding "trait substitution tree" structure. 
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FIGURE 2. Trait substitution tree constructed by embedding excursions. 
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In order to generalize the process to infinite trait space, we introduce another evolution- 
ary mechanism, mutation of a trait x with a transition kernel p(x, dh) for mutant variation 
such that i + fteA". Notice that the essential difference between mutation and migration 
is that mutation creates some new trait, while migration is only allowed among the ex- 
isting traits. More precisely, we specify a new model {X^' a ,t > 0} on D([0, oo), M. F [X)) 
with the following infinitesimal generator, for any e, a > and proper test functions F 
and 0, 



L £ '°F(v) 



x 

+ e 



b(x) — d{x) — I a(x,y)v(dy) 
Jx 

SF{w,x] 



6F{v;x) 

6^~ 



v(dx) 



A 



x 



Su 



v(dx) 



(4.1) 



+ a / [F(u + pS x+h ) -F(i/)]p(x)p(x,dh)v(dx) 

JX JR d 

where the derivative of F is defined by 



6F(v; x) 
~8v 



lim 

e-s>0+ 



F(v + e5 x )-F{y) 



and the operator A coincides with the migration term in (3.5) 

A(f>(x) =■ J [<f>(y) - (f>{x)] l {2/6sU pp {i , } }m(:r, dy). 



(4.2) 



(4.3) 



x 



The first term of the generator describes the local regulation of population dynamics. The 
second term describes migration among supporting trait sites. Note that migration is not 



restricted to birth events any more as in (3.5), which is reasonable if we interpret them 



as a changes in gene expression. The last term creates a new mutant trait to the current 
population. The mutant mass is specified by a magnitude of p > 0, which can be taken 
to zero in a final step. The non-negative function p(x) describes the mutation rate of the 
resident trait x. The parameters e and a are used to rescale the strength of migration 
and mutation of the population. For any fixed e, o > 0, the process {X f e,CT ,t > 0} can 
be obtained as a large population limit (as n — > oo) of the processes specified by the 
generator 



r F{v) 



x 



X 



+ e 



F(v 


•+-)-F(v) 

n 


b{x)nv{dx) 


F{v- 5 -)-F{v) 
n 


1 d{x) + / a(x,y)u(dy) J nis(dx) 


L 


F (v + JL - — 
n n 


-)-F(iz) 


^-{ye S upp{u}}m(x, dy)nv[dx 



(4.4) 



+ a / / [F(u + pS x+h ) - F(v)\p{x)p{x,dh)v{dx). 

J X JR d 

For more discussion on discontinuous superprocesses with a general branching mecha- 
nism, one can refer to [Tjj]. We will not expand the discussion here. 

The following assumptions (D) ensure that the limiting TST process is well-defined. 
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(Dl) For any given set of distinct traits {xo,x±,--- ,x n } C X,n <E N, there exists a 
total order permutation 

where x -< y means that the fitness functions satisfy f(x,y) := 6(x) — d(x) — 
a(x,y){;(y) < 0, and /(y,x) := 6(y) - d(y) - a(y,x)£(x) > 0. 

For simplicity we always assume x -< x™ -<•■■-< Xn with x^ = x ni , 
1 < i < n. Every time a new trait x appears whose fitness is between x™ and 
x^ for some < j < n, we relabel the traits as follows 

4 n+1) ^4 n+1) ^---^ +1) ^ n + + i 1) , (4-6) 

where Xj = x\ for < i < j, x™ +1 = x and x\ n = x^ for j+2 < i < n+1. 
(D2) Competition and migration only occurs between nearest neighbors, i.e., for totally 
ordered traits in (Dl), we have m(xf l \xj) = a{x\ ■ , Xj ) = for | i — j |> 1. 

Notice that assumptions (C3-C4) provide a convenient setting for which the evolution- 
ary path on the migration time scale can be easily identified. More complex situations 
can, however, be analyzed in a similar way and lead to qualitatively similar results. 

We now give a description of the limiting process on the mutation time-scale. 

Definition 4.1. A M.p(X) -valued Markov jump process {T t : t > 0} characterized as 
follows is called a trait substitution tree with the ancestor T = £,{xq)5 Xo . 

(i) For any non-negative integer I, it jumps from T^ := Y^i=o^( x 2i )^ ( 2i ) to T^ 21 ^ 

with transition rate £(x 2k )fi(x 2k )p(x 2k ,dh) for any < k < I, where 

- rw = El =1 fcSSi)^ + £(4? + h)6 x n +h + EUi+i ^(4? V 4f 

if there exists < j < I s.t. x 2 j -< x 2k + h -< x 2 ^ +1 , 

(21) n x . v^i W (21) S. 

'■■ ■ • '■ "• (2i) 



_ r(^D = n=ie>Sfii)^ i + EUe(4fV 4 - 

i/ t/iere eaxste < j < / s.t. a^j-i "^ ^k + ^ "^ x 2j • 
Then, we relabel the trait sequence according to the total order relation as in (Dl): 

(21+1) (21+1) (21+1) (21+1) U ?] 

where in associate with the first case 

x? +1) := af> /or < < < 2j, x§+? := sg? + *, 

x f /+1) ;= x J 2/ j for2j + 2<i<2l + l, 
and in associate with the second case 

xf l+1) := xf l) for0< l < 2j - 1, x% +1) := x™ + h, 
x f l+1 ^ ■= xf}\ for2j + l<i<2l + l. 
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(ii) For non-negative integer I, it jumps from T^ 21 ^ := ^2i=i £i x 2i-i )^ ( 2 ' +1 ) to T^ 21 ^ 

X 2i-1 

with transition rate ^,{x 2k _ 1 )/J'(x 2k _ 1 )P( x 2k-i > ^0 f or an y I < k <l + l, where 



if there exists 1 < j < I + 1 s.i x 2 j-i ~^ x 2k-i + h -< x 2 - , 

if there exists 1 < j < I + 1 s.t a4j-2 "^ x 2fc-i + h -< x 2 ^_ 1 . 
Then, we relabel the trait sequence according to the total order relation as in (Dl): 

{21+2) (21+2) (21+2) (21+2) ( . ^ 

x Q -<; x 1 -<•••-< x 2l+1 -< # 2 z+2 , l 4 -°J 

where in the first case 

xf +2) := xf +l) /or < * < 2j - 1, x% +2) := x$*? + h, 

x\ 2l+2) ■= x£ +1) for 2j + l<i<2l + 2, 
and in the second case 



x (2l+2) ;= x (2l+l) /or < , < 2j _ ^ x i2l+2) ;= x (W) + K 

x f l+2) ■= x^+i) y or 2j<i<2l + 2. 

Remark 4.2. (see FigureVw. According to the definition, the new configuration is con- 
structed in a way that every second trait gets stabilized when one "looks down" from the 
fittest trait along the fitness landscape. Once a mutant is inserted between two trait levels 
(say, i andi + 1), we relabel all the traits above the mutant's level. However, the mutation 
only alters the configuration below the i + lth level and not above. To some extent, the 
construction here is similar to the look-down idea of Coalescent processes (see [S]J. 

Theorem 4.3. Admit assumption (A) and (D), and consider processes {X^ ,cr ,t > 0} 



described by the generator (4.1). Suppose that Xq* 7 = £, e (xo)o~ Xo and£ e (xo) — > £(xo) in law 



and p — )■ ; as e — > 0. If it holds that 

then (X € l' 7 )t>o converges, as e — > 0, to the trait substitution tree (r i ) i > given in Definition 



->ln-, (4.9) 

a e 



4-1 Convergence is in the sense of finite dimensional distribution. 



We postpone the proof of the above result in Section |5.2 

5. Outline of proofs 



5.1. Proof of Theorem 3.1 In this section we present the proofs of the results of 
Section [3j The main idea behind the proofs is that the migration spreads linearly and the 
nearest neighbor competitive growth spreads exponentially fast. Before proving Theorem 
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/ 4 3) (=4 4) ) 


— 


^ 4 3) (=4 4) ) 


s/ 





\ 4 4) +m=4 5) ) 



o 

\_ 

o 



4 4) (=4 5) ) 



o- 



4 4) (=4 5) ) 



x 2 \~ x 2 I 



*«M»>) 



x o \— x o ) 



r(3) 



r< 4 ) 



r(s) 



FIGURE 3. A partial path (r( 3 ) -> I^ 4 ' -> r^ 5 )) of the supporting set of the Trait 
Substitution Tree process defined by Definition [4T] Solid lines with "diamond" get fixed 
in the TST process, dotted lines denote virtual traits which are temporarily lost, while 
dashed lines denote new mutant traits. Arrows denote creation of new mutant types. 
Starting with state I^ 3 ', the first mutant arises from trait x\ , and new trait fits into 



„(3) 



,(3) 



,( 3 ) 



< x\ + h\ < x\ . According to Definition 4.1 (ii), traits :Eq , x\' + h%, x\' get 



,(3) „.(3) 



,(3) 



fixed. Consecutively, based on state configuration r^ 4 -*, the second muta nt a rises from 

' 4 ' onJ r.0,1, + r-oi + fi + o IritA A-> _-< „A 4 ) 



trait x^' , and new trait fits into x 3 
x\ , Xq , X4 get fixed. 



-< x 



,(4) 



hi -< x\ . By Definition 4.1 (i), traits 



3.1 we state some preliminary results which are key ingredients for the proof of Theorem 



The following lemma ensures the non-coexistence condition for a dimorphic Lotka- 
Volterra system. We give the proof in the appendix. 

Lemma 5.1. Consider a dimorphic system 

it{xi) = (b{xi) - d(xi) - a(xi,Xi)£ t (xi) - Oi{x h x i+ i)£ t (x i+1 U & (a;*) 

it{x i+ i) = (b(x i+1 ) -d(x i+ i) - a(x i+ i, Xi)£t(xi) - a(x i+ i,x i+1 )£ t (xi + i) )&(zi+i)> 



(5.1) 

with some positive initial condition. If fij+i < 0, /i+i,i > 0, then (0,£(xj + i)) is the only 
stable equilibrium of (5.1). 



The following two propositions are used to prove Theorem 3.1 



Proposition 5.2. Under the assumptions of Theorem 3.1, for the case when L = 2 (i.e. 
X = {xo,xi,x 2 }), the limit process (U t )t>o has the form 

' {(x )5 X0 for0<t< I h 

£{xi)6 xi for h<t< I 2 , 

£{x 2 )S X2 for I 2 <t<I 2 + S , 



U f 



(5.2) 



£,(x )S Xo + $,(x 2 )5 X2 for t> I 2 + S , 



where I\ 



1 — ; I 2 



1 

/l,0 



-?—, and So 

/2,1 ' 



l/o,il 



hAK x o)-d(x )) ' 
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Proof, (a) First, suppose that the population consists of only two types, X 
We divide the entire invasion period into four steps, as shown in Figure |4| 



13 
{x ,Xi}. 




time 



Pi In? 



FIGURE 4. Four-step invasion analysis for a dimorphic system 



(5.3) 



(5.4) 



Let £j(xo) := (XtA{x }} and £$(xi) := (Xf, l{ Xl }}- From (3.5) one obtains 

it( x o) =(b(x ) - d(x ) - a(x , xo)&(xo) ~ a^o^OCt^i))^^) 

- e£l(xo)b(x )m(x , xi) + eil(x 1 )b(x 1 )m(x 1 , x ), 

and 

it( x i) =(K x i) -dfa) -a(x 1 ,x )Q(x ) - a(x 1 ,x 1 )^(x 1 ))^(xi) 

- eQ(xi)b(xi)m(xi, x ) + e^(x )b(xo)m(xo, xi), 

where £0(^0) = £{%o) and §j(a;i) = 0. 

Step 1. For any fixed 77 > 0, V0 < e < 77, let T 6 ' 1 be the time when (^Q(xo),Ct( x i)) leaves 

the e- neighborhood of (£(a?o),0), i.e. 

T^ 1 = inf {t > : £(an) > e, or £(x ) < £(x ) - e}. 
From ( |5.4 ) it follows that , for t < T 6 ' 1 , ^(a?i) satisfies the following differential inequality: 
£t( x i) >(b(xi) — d(xi) — a(xi,Xo)^(xo) — ea(xi,xi) — eb(xi)m(xi,Xo))^l(xi) 

+ e(^(x ) - e)b(x )m(x , xi) (5.5) 

= (/i,o - e(a(xi,xi) + 6(xi)m(xi,a;o)))^ 6 (xi) + e(£(x ) - e)6(s )m(x ,xi). 

Since /i >0 = &(xi) — d(xi) — o>(xi,Xq)£,(xo) > 0, we can choose e sufficiently small so that 
the first term on the right hand side of the above inequality is positive. Omitting this 
positive term, one sees that £t{xi) < ^(^i)? where £0(2:1) = 0, and 

It(si) = e(^(x ) - e)b(x )m(x , a?i). (5.6) 

Thus, T e ' 1 can be bounded from above by T 6 ' 1 = ((£(xo) — e)&(a^o) m (a ; o,3 ; i)) , which i 
the time when £t(xi) reaches the level e-level. Thus, T e>1 is of order 0(1). 



is 



TRAIT SUBSTITUTION TREE 14 

Step 2. After time T e ' 1 , we consider the evolution of the population (^Q(xo),Q(xi)) 
until the time (denoted by T v ' 1 ) when it leaves the ^-neighborhood of (£(xo),0). From 



(5.4), omitting the term e^(xo)b(xo)m(xo,xi), we get 



6(*i) 

> (b(xi) - d{xi) - a(x x , x )Q(x ) - a(a?i, a7i)^(a?i))^ 6 (a;i) - eC t {xi)b(x 1 )m(x 1 , x ) 

> (6(ari) - d(xi) - a(xi, x )^(x ) - r/a(xi, Zi))£ t e (xi) - ^(xi)6(xi)m(xi, x ) 

= (fi,o-vC)C t (x 1 ), 

(5.7) 
where C = a(xi,xi) + b(xi)m(xi,Xo)- On the other hand, by omitting some negative 



terms in (5.4), we get 



tt(xi) < (6(xi) -d(xi) - a(x u x )Q(x ))C t (xi) + e^ t (x )b(xo)m(x 0l x 1 ) 

< (b(xi) -d(xi) -a(xi,x )(^(x ) -^))Ct(^0 + e^(x )6(x )m(x ,xi) (5.8) 

<(/i,oH-77C0£(zi), 

where (7 = a(xi,xo) + £{xo)b(xo)m(xo,Xi). 

Applying Gronwall's inequality to (5.7) and ( |5.8 ), the density Q(xi), starting with 



^ E ,i(xi) = e, can be bounded from below by £t(xi) and from above by £t(xi), which 
satisfy the equations 



and 



lt(xi) = (/i,o-C'7 ? )| t (xi), (5.9) 



lt(xi) = (/ 1 ,o + C'7 ? )|t(xi), (5.10) 



with initial conditions £ T e,i(o:i) = ^r^O^i) — e 5 respectively. 

The times needed for £t(#i) an d £t(#i) to reach the 77- level can be computed explicitly. 
They are given by f"?- 1 - T 6 ' 1 = (/ 1(0 - C77)- 1 In a and f^ 1 - T 6 - 1 = (f lfi + C77)- 1 In 2, 
respectively. Since T 11 ' 1 < T 77,1 < J 1 * 7 ' 1 , for any 77 > 0, T* 7 ' 1 — T^ 1 is of order /{" In -. 



Step 3. From (3.5), the process (£|(a;o),£|(a;i)) starting from time T' 7 ' 1 converges, as 



e — )• 0, to the solution of the following system: 

it{x ) = (b(x ) - d(x ) - a(xi, x )£ t (x ) - a{x u &i)&(a;i))£t(a?o) ,- ^ 

itM = (b{xi) -d{xi) -«(xi,x )6(a;o) - a(x 1 ,x 1 )£ t (x 1 ))£ t (xi), 

which has a nontrivial initial value £pj,i (a?i) = 77, and ^,1 (x ) G (£(#0) — ^ £(^0) + ^) • By 



Lemma 5.1, this dimorphic system has a unique stable equilibrium (0, £(#1)) under the 
assumption that /i : o > 0, /o,i < 0. Let T^ 1 be the time when (£t( x o),£t( x i)) enters the 
77-neighborhood of the equilibrium (0, £(#i)), i.e. £^,1(^0) = V- Since 77 is a given fixed 
constant, T" 1,1 — T 71,1 is of order 0(1) as e — > 0. 
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Step 4. After time T 11,1 , we consider the time needed for x\ to get fixated (i.e. for xq 
gets absorbed at 0). From (5.3), one obtains the differential lower bound: 

tt(x ) 

> (6(a? ) - d(x ) - a(x , x )€$(x ) - a(x , xi)^(xi))^(x ) - e^(x )b(x )m(x , x x ) 

> (b(x ) -d(x ) -r)a(x ,x ) - a(x ,xi)^(xi))^(a;o) - rjb(x )m(x , xi)^(xq) 

= (fo,i-v6)${x ), 

(5.12) 
where C = a(x , x ) + b(x )m(x , x\). As for the upper bound, we observe that 

it{xo) < (b(x ) -d(x ) -a(a;o,a;i)^(xi))^(xo) + eCt{xi)b{xi)m{xi,x ) 

< (b(x ) -d(x ) -a(a; ,xi)(f(a;i) -rj))${x ) + e£(xi)b(xi)m(xi,Xo) (5.13) 

<(fi,o + vC)Ct(xo), 

where C = a(x , xi) + £(xi)b(xi)m(xi, x ). 

Applying again Gronwall's inequality to (5.12) and (5.13), we see that ^(x ), starting 
with £1 1 (xo) = r), can be bounded from below by £t(xo) and from above by £t{xo), which 
satisfy the equations 

kxo) = (fo,i-Cr])£ t (x ), (5.14) 

and 

Uxo) = (fai + Cv)Uxo), (5.15) 

With ifr,,i{x ) = £,fr,,i(xo) = V- 

Since /o,i = b(xo) — d(xo) — a(xo,Xx)^(xi) < 0, we can choose r\ small enough so that 
/o,i + Cr] < 0. Therefore, both £t(#o) an d £t(^o) decay exponentially. For any p\ > 0, the 
process £,t(xo), in time of order piln -, reaches the e _Pl ^°' 1+r,c '^-neighborhood of 0, while 
it(x ) reaches the e'^^^-neighborhood of 0. Let T e >° := f^ 1 + p ± In \ . Then, 

lim&,o(*o) = lim^Cxo) exp ((/ 0|1 + Crj)(T^ - T^ 1 )) 

= lim^exp((/ , 1 + L7r / ). Pl log^) ^ 

= ]ime~ pi/o ' 1 -Oiri) 
= 0. 

Similarly, we obtain lim£y e]0 (x ) = 0. Therefore, lirn£y e}0 (a;o) = 0. Therefore the subpop- 

ulation at X\ eventually gets fixated as e — > 0. 

Combining the four steps above, one concludes that the right time scale for the fitter 
population X\ to get fixated is 

(f^+ Pl )\n- e . (5.17) 

(b) (Recovery process: see Figure [5]) Next, we consider the case when there are three 
distinct trait types, X = {xq,Xx,X2}- At the same time as the population on the site xq 
migrates towards the new site X\ (as shown in (a)), the population of trait x\ can migrate 
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to the site x%. Let Q(x 2 ) '■— (X£,l{ X2 y). In the following, we re-analyze the evolution 
process after adding one more trait x 2 to the previous case with trait space {xo,£i}. Due 
to an e-fraction of initial migration from the subpopulation x to X\ it follows that a e 2 - 
fraction migrates from subpopulation x\ to x 2 . We have ^,i(x 2 ) = e^ £il (xi) = e 2 t;(x ). 
Since the population growth of trait x 2 is in an exponential rate b(x 2 ) — d(x 2 ), the time 
needed for ^(x 2 ), starting with mass of order e 2 to reach the given 77-level, is of order 
lni. 



b(x2)-d(x2) 



As shown in Figure pi because of assumption (C3) we have that 



> 



and 



b(x 2 )-d(x2) /i,o' 

the influence of the population at x 2 is negligible before the time T V)1 when the population 
at x\ reaches the level 77. Since T* 7 ' 1 — T 77,1 = 0(1), the population at x 2 , starting with 
6k, 1(^2) = e • 0(1), evolves under the competition from its resident population x\ as 
follows 

t t [x 2 ) ={b(x 2 ) -d(x 2 ) -a(x 2l x 1 )^{xi) - a(x 2 ,x 2 )Q(x 2 ))C t (x 2 ) 

- e^(x 2 )b(x 2 )m(x 2 , x\) + e^(x 1 )b(x 1 )m(x 1 , x 2 ), 

where ^ e ~ x {x-\) G (£(xi) — r],t;(xi) + ri). On the other hand, until time T 77 ' 1 the populations 
at x and X\ still behave the same as in Step 1-Step 4. Thus, we embed Figure [4] into 
Figure [5] and continue the proof based on the four-step analysis in (a). 

density t 



(5.1* 



£(*) 



n 




















































1 




1 




1 1 








1 




1 





J"M jiri, 



recovery of x 



time 



FIGURE 5. Three-type density evolution (€t( x o),€t( x i)'€t( x 2)) 

Let T* 7 ' 2 be the first time when £t(x 2 ) enters above the level 77. By similar arguments 
as used in Step 2, one can control ^\{x 2 ) by two other curves £t(x 2 ) < €t( x 2) < £,t{x 2 ) 
described as follows, for T* 7 ' 1 < t < T v ' 2 , 



and 



<T v, 
l(x 2 ) = (f 2,1 -Cv)&X2), 



(5.19) 



k*2) = {f 2,1 + Cr])£ t (x 2 ), (5.20) 

where the constants C, C change from line to line and £,f n ,i(x 2 ) = £,f nA (x 2 ) = e ■ 0(1). It 
follows that 

— ^- In - < T 77 ' 2 - f "- 1 < -^ln-. (5.21) 

/2,1 + Or; e h,i-Cri e 
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On other other hand, due to the comparison assumption (C3): h , ^ d(x \ > j— + j—, we 
conclude that the shorter time length among the two for £f (#2) to reach 77-level satisfies 

T^ + ^- -S] In- < T"' 2 < (-L + -L + s)]n-. (5.22) 

1 1,0 J2,i J e \/i,o J2,i / e 

During time interval [T 77 ' 1 , T' 7 ' 2 ], consider the population Q(xq). We inherit the estimate 

£t(xo) < £t( x o) < 6( x o) from Step 4. Subpopulations £t(xo) and £t(^o) ar e described by 
solutions of two equations (5.14) and (5.15), which imply that 

it{x ) = $rn.i{xa)e<*>.i-*<i)(*-?* 1 ), (5.23) 

and 

lt(xo)=fe,.i(xo)e^^<*-^ (5.24) 

with |r„,i(xo) = iWifco) = ?7- 



Combining above with (5.21), one obtains 

fo,i-Cr, fa,i+&r, 

ve f2,i-o, < ^ 2 ( Xo ) < ^, a ( Xo ) < |^, 2 (xo) < Tie ^+<Y (5.25) 

Taking e- migration from its neighbor site X\ into account, the mass on xq is of order 

1-fo.il ,, . l/o.il I/q,iI 

e /2 >! V e. Due to assumption (C4) : j^ < 1, one obtains that e ^ V e = e ^ . 

Short after T''' 2 , as e tends to 0, there follows an immediate swap between Q{xi) and 

£,l(x2) approximated by a Lotka-Volterra system as in Step 3. Denote by T v ' 2 the first 

time when (Q(xi),Q(x 2 )) enters the //-neighborhood of the equilibrium (0, £(3:2)), i.e. 

£~ 2 (a?i) = 77. Also, T v ' 2 — T 71,2 is of order 0(1). Thus, one obtains 

fo.l-Cy fo,i + Cv 

e a,i-o, . 0{j] ) < £i ;2 ( Xo ) = ^, 2 (x ) ■ 0(1) < e ^+ c " ■ 0(77). (5.26) 

Let T v, ° denote the first time after time T* 7 ' 2 when ^(xq) reaches rj- level. For T v ' 2 < t < 
T v, °, £t( x o) is governed approximately by a logistic equation 

it(x ) = (b(x ) - d(x ) - a(x , x )€t(x ))£t(x ). (5.27) 

Then, we have the differential inequality 

(b(x ) - d(x Q ) - a(x ,x )r))£,t(xo) < £t(xo) < (b{x ) - d(x ))£, t (xo), (5.28) 



where ^,2(^0) satisfies (5.26). Then, by Gronwall's inequality, one obtains 
/o,i +&V ,_ 1 . ^,0 - 



In - < T v ' u - T^ 

(f2,i + C V )(b(xo)-d(x )) e 

^ /o,i ~ Cr, 1 

< z - In-. 

C/2,1 - Cr)){b{Xo) - d(x ) - a(x , x )r]) e 

(5.29) 
After T v, °, £f(xo) approaches £(xo) in time length of order 1. 
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lim , 
v->o In - 



Combining the analysis on T v ' 1 in Step 2, and the estimates in (5.21) and (5.29), since 
r] > is arbitrary, we obtain that 

T~ =: Jl ' 

7 — = : h - h, 

J 2,1 

—fo,i 



lim 



lim 

77—5-0 



lni 

e 



(5.30) 



5n 



ln^ / 2)1 (fo(xo)-d(x )) 

Therefore, X 4 e , rescaled on a time scale of order In-, converges to the TST process £/, 



(L = 2) with the form (5.2). 



t 

D 



Proposition 5.3. Admit the same conditions as in Theorem \3.1 Consider the case when 
L = 3, i.e. X = {xq,xi,x 2 ,x 3 }. Then the limit process (U t )t>o has the form 



U t = < 



i 

/l.O 



1 

/2,1 



{(xi)5 Xl 

{(X2)Sx2 

i{x 3 )s X3 

£(zi)4i + £{x 3 )5 x . A 
-, and Si - ^ 



for < t < I u 
for h<t< I 2} 
for I 2 <t< I 3 , 

forI 3 <t<I 3 + S ll 
for t>I 3 + S 1 



(5.31) 



where 1 3 /iq , ^ , fy ^, „,„„ ^ - /32(6(:!;i) _ d(xi)) 

Proof. (See Figure |6J) Due to an e-fraction of migration from a subpopulation to its neigh- 
bor subpopulation it follows that ^,i(x 3 ) = e£^ e ,i(x 2 ) = e 2 ^ eA (xi) = e 3 ^(x ). Since the 
density of trait type x 3 increases in an exponential speed b(x 3 ) — d(x 3 ), the time needed to 



In -. By assumption (C3): 



> 



/2,1 



reach a level 77- level is of order r? — A-., -•■ ■ •-.*■■■ • < u,n,u.„, , v . ., ,. „ . .. 

' b{X3)-d(X3) e J r \ / b(X3)—d(X3) /i,o 

it implies that the population on trait site x 3 is negligible, i.e. of order e, before the time 
T v ' 2 . Thus until that time the evolution of the populations at the other sites preoceeds 
as if this site did not exist. The analysis and notations such as T 71 ' 1 ,^' 1 , T V,2 ,T V ' 2 then 



carry over from the proof of Proposition 5.2 

The evolution of £| (x 3 ) after time T 77 ' 2 is governed by the equation 

itM ={K X 3) - d(x 3 ) - a(x 3 , x 2 )Ct(x 2 ) - a(x 3 , x 3 )Ct(x 3 ))Ct(x 3 ) 
- e&(x 3 )b(x 3 )m(x 3 , x 2 ) + e^(x 2 )b(x 2 )m(x 2 ,x 3 ), 

with initial value i~, r)2 {x 3 ) = e ■ 0(1). 

Let T' n '° and T v ' 3 be the first time (resp.) for ^(xq) and &(x 3 ) to reach 77-level after 



(5.32) 



T v ' 2 . Similarly as in the derivation of Eq. (5.21), we get 

1 



rpr),i T~"7,' 



, In- 

J3,2 e 



(5.33) 



where /(e) 



g(e) means lim /(e) I g(e) 

e->0 



Recall from (5.29) that 



t-u7,o r p'n-,'- 



7< 



0.1 



f2,i(b(x ) -d(x )) 



In 



(5.34) 
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FIGURE 6. Four-type density evolution {€t(xo),€f(xi),t${xn),€f(xa)) 



From assumption (C4), one obtains that 

-/o,i 1 



> 



-/: 



1.2 



f2,i(b(x ) - d(x )) / 3 , 2 h,2{b{x x ) - d{x x )) 



>0, 



(5.35) 



which implies that T 71,3 < T v '°. Hence, for t G [T 71 ' 2 ^ 71,3 ], the population at site Xq, ^(xq), 
stays in some small e-dependent neighborhood of 0. Furthermore, £|(xi) * s influenced 
mainly from the competition with £(£2). Using comparison arguments as above, one 
derives that at time T 71 ' 3 

SUsi) = £M*i) e/l ' 2(T ^' 2) 

~ r]e fl ' 2f ^ ln « 

/l,2 



(5.36) 



r/e 



/3,2 



Similarly as in Step 3, after the time T 71 ' 3 , 0(x 2 ) and £ t e (x 3 ) swap their mass in a time of 
order 1, and £4(22) decreases below a level r] at time T v ' 3 . After time T 71,3 , Q(xi) evolves 
approximately as a logistic growth curve since there is only negligible competition from 
neighbors xq,X2, i.e. 

4 e (x0 = (&(*!) - d( Xl ) - a( Xl , xjCtix^Ctixi), (5.37) 

/l,2 



with initial value £~ 3 (^i) ~ V e 



Ti,' 



ha 



Denote by T v ' ' the second time for the population at site x\ to increase to the level 



r\. Because of the exponential growth property, as in the derivation of (5.34), one obtains 

-/l,2 



J"? 



1.1 



J^Vv 



f3,2{b{xi) -d(xi)) 



111 



1 



(5.38) 



Comparing the times computed above with the estimate (5.35) on T v '°, one observes 



that T 77 ' 1,1 < T v, °. This means that ^(xi) recovers to level r] faster than <^(xo) did. 
Consequently, £| (xq) will be pushed to due to competition from the fitter type x\. 
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Combining (|5.33|), (|5.38|) and the first two equations in (5.30), we obtain the claimed 

□ 



form of the TST limiting configuration (U t ) for L = 3. 

Lemma 5.4. Assumption (C4) implies the following inequalities, for any 4 < L 6 N 



•/. 



o.i 



> 



f2,i(b(x Q ) - d(x )) / 3 ,2 

~fl,2 1 



> 



f3,2(K x l) ~ d(xi)) / 4 ,3 



+ ••• + 



fh,L-\ 
1 



/l, 



L-l 



(5.39) 



./. 



L-3.L-2 



> 



1 



fL-l,L-2(b(x L - 3 ) - d(xL-2,)) " h,L-l 



and 



-f. 



L-4,L-3 



1 



1 



> 



~JL-2,L-1 



fL-2,L-3(b(x L _ A )-d(x L _ A )) f L -l,L-2 /&.L-1 fL,L-l(b(x L - 2 ) ~ d{x L - 2 )) 



(5.40) 



-Jo, 



I 1 > -/2,3 



/VC^o) - d(x )) / 3 , 2 /4,s fwiKz*) ~ d (x 2 )) 

and so on. 

The proof of this Lemma follows iterations straightforward from assumption (C4). On 
the one hand, from (5.39), it implies that when it passes to the limit process U t , all 



processes except Q(xl) stay in e-dependent infinitesimal neighborhoods of at time T v ' 
which denotes the establishing time for type Xl- It leads to monomorphic transportation 
of the mass from the initial trait Xq to the fittest trait xl in the first half period. On the 



other hand, from (5.40), it guarantees that the fitter one recovers earlier than the unfit 



traits alternatively backwards to the most unfit one. 



Proof of Theorem \3.1\ After justifying the form of (U t )t>o for L = 2 in Proposition 5.2 
and L = 3 in Proposition |5.3[ we proceed the proof along two lines, according to whether 



L is an even or odd integer. We here only prove cases along the line when L is a even 
integer. 

We now deduce the expression for (Ut)t>o when L = 4 based on the result for L = 2 
(see Remark 3.2 (2) and Figure [2j. 

after time T 7 '' 3 , we introduce T 77 ' 4 



5.3 



Based on the analysis in the proof of Proposition 
which is defined as the first time for £((24) to reach the ^-level. Similarly as before, we 
can show that 

T^-f 77 ' 3 - — In-. (5.41) 

74,3 e 



Then, after time T v ' 4 , to mimic Step 3 in the proof of Proposition 5.2, it follows with a 



selective sweep between subpopulation X3 and £4 until time T 77,4 such that £~ 4 (x3 



Tv,- 



At this stage, £4(^2) starts to recover due to the lack of competition from ^(£3). Sim- 
ilarly as in the derivation of (5.38), the time needed for ^(^2) to again reach the 77-level 
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(denoted by T^ 2,2 ) can be computed explicitly 



rpr),2,Z r PVl' 



12,3 



f4M x 2) -d(x 2 )) 



In 



So In 



which, due to T 77 ' 4 ~ /4I11-, implies 



1 



T^ 2 ~(J 4 + ^ 2 )ln-. 



(5.42) 



(5.43) 



After that, it will approach its equilibrium £(£2) as a solution of a logistic equation. 
Consequently, Ct( x i) wm drift to due to competition from its fitter neighbor-trait x 2 . 
Recall from (5.34) that 

-/o,i 1 L 1 - l 

/4,3- 



rpr),0 _ rp 



r)A 



which implies 



f2,i(b(x ) - d(x )) h.2 
:(5o + / 2 -/ 4 )ln- 



T^ ~ (J 2 + S ) In -. 

e 



In 



(5.44) 



(5.45) 



Combining the above two estimates (5.42) and (5.44) with assumption (5.40), one observes 
that 

rpr),2,2 T~"),4 ^ T~"?,0 r pf]A 

Moreover, we have 



(5.46) 



7^7,2,2 
lim 7- 

r,->-0 ln- 



1 1 1 

/l,0 72,1 f3,2 

h + S 2 , 



+ 



I/: 



2,3| 



h,3 f4,3(b(x 2 ) - d(x 2 )) 



and 



.0 



lim — r 

r/^0 In - 



On- 



(5.47) 



(5.48) 



We thus obtain the explicit form of (3.6) for L = 4 



U, 



i(x )S X0 
i(xi)5 Xl 

i(x 2 )s X2 
i(x 3 )5 X3 
i(x 4 )s Xi 

£(x 2 )5 X2 + £(x 4 )5 X4 



for < t < h, 

for h <t < I 2 , 

for I 2 <t< I 3 , 

for h <t < J 4 , 

for J 4 < t < h + S 2 , 

for I A + S 2 <t<I 2 + S , 

for t> I 2 + S 



(5.49) 



, f (zo)$e + ^(x 2 )S X2 + £(x 4 )5 X4 

which is constructed on top of U t for L = 2 (see (5.2)) by partitioning the interval 
(I 2 , I 2 + Sq] into intervals (J 2 , J 3 ] U (I 3 , J 4 ] U (J 4 , I4 + $2] U (J 4 + 5 2 , h + S }. 

To mimic a similar procedure, a TST process for L = 6 can be obtained by connecting 
the TST process for L = 4 with a sub-TST consisting of traits {x 4 , 25, xq} specified as in 
Proposition |5.2 

□ 



Recursively, for all even integers L, the form of (3.6) follows. 



TRAIT SUBSTITUTION TREE 22 



5.2. Proof of Theorem 4.3 To prove the Theorem L3 we proceeds by listing two 
key lemmas. In the first one, we conclude that the occurrence time of each successive 
mutation is asymptotically characterized by exponential distribution while rescaling time 
in an appropriate way. In the second lemma, we justify that a one-step transition from 
a current configuration to a new one is in probability one as the migration rate e tends 



to 0. To avoid repeating arguments, we here only prove the first case of Definition 4.1 
while the second one follows a same fashion. The proof is carried out by the method of 
mathematical induction. 

For any non-negative integer I, denote by T^ the atomic measure with finite support, 
i.e., r( 2 ') = £U£(4?V X («). Similarly, set r^ +1 ) = ESl^)^), whose form 

is described as in Definition 
proved as in Proposition 5.2 



4.1 As for the transition from T^ ^ to T^ 1 ', it is trivial to be 
(a). To the end, it remains to show that the transition rule 
also holds from configuration r*- 2 '-* to T^ 2l+1 ' for any I E N. Denote by P r <20 the law of 
the process Xf' ff with initial configuration I^ 2 ". Denote by r € the first time after when 
there occurs a new mutation event. 

Lemma 5.5. Admit the same conditions as in Theorem \4-3[ 

, i 

limP r(20 (r e > -) = exp ( - ^£(4?)M4?))- (5-50) 

This lemma can be proved in a very similar way as the one for [31 Lemma 2 (c)]. We 
will not repeat the details here. 

Lemma 5.6. Assume Xq ,C7 = T^ 21 ' +p5 pi) h . Then, for any e > 0, there exists a constant 
C > such that 

limP(r e >ln-, sup \\X^ a - r (2m) || < e) = 1 (5.51) 



e->0 v e 



te(Clni,T E ) 



where r^ 2 ' +1 ^ is defined as in Definition 4-1 (i), and \\ ■ \\ is the total variation norm 



Proof. From Lemma 5.5 and - ^> In -, one concludes that, for any C > 0, 



limPfV >Cln-) = 1. 



According to assumption (Dl), there will be one and only one ranked position for the 
new trait x 2k + h among the supporting traits {x , . . . , x 2 j , . . . , x 2l } of r^ 2 ^. We will 

(2/) 

classify two cases according to whether x 2k + h falls on right or left hand side of some 

4f for < j < I. 

If there exists some x 2j (0 < j < I) such that x 2k + h falls on the right of x 2 a , one 
has the local fitness order 

. . . -< x 2 j_ 1 -*< x 2 j -< x 2k + ti -< x 2 j +1 -<.... (5.52) 

Since it is unpopulated for both trait sites x 2 ^_\ and x 2 j +1 at state T^ 2 ^, we consider 
(x 2 j , x 2k + h) as an isolated two-type system because of the assumption of nearest- 



neighbor competition. By analysis in Lemma 5.1 , the population density of this two-type 
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system converges to (0, £,(x 2k + h)) in time of order O (in i) as e — > 0. On the right hand 
side of this pair (x 2 j , x 2k + h), the configuration X^=j+i €( x 2i J^P") keeps stable as at 
previous state T^ 2l \ Whereas on the left hand side of ccL , population density of trait 
x 2 j_i, due to the decay of its competitive fitter neighbor trait x 2 j , recovers exponentially 
fast upto the stable equilibrium £(a?<y-i) °f the following logistic equation 

4 e (4f i) = (K4f i) - rf(4f i) - «(4f i, 4f i)a(4fi))^(4f i)- (5.53) 

Continuing in the same way, the mass occupation flips on the left hand side of the trait 
site x 2j such that traits {x 2i _ ± , 1 < i < j} get re-established while {x 2i , < i < j} are 



eliminated. By similar arguments as in the finite trait case (see Section 5.1), the entire 
rearrangement process can be completed in time of order O(ln-). We obtain the new 
equilibrium configuration 

j _ _ i 

r (2i + i) = Y,ax%l 1 )6 x m i +Z(x 2 f + h)6 x va )+h + J2 £(4?)i( f . (5-54) 

In the other case, the fitness location of trait x 2k + h falls on the left hand side of some 
x 2 j for < j < I, that is, 

(21) (21) , (21) (20 

x 2j-l ^ x 2fe T '* -=> X 2 j -S ^2j'+l- 

Similarly, consider the sub-populations (a^ +/i, #2j ) as an isolated dimorphic system as 
in Lemma 5.1 Since p — > as e — > 0, we obtain that (^(ajL. + h),^ t (x 2l • ) ) , starting 

with (p, ^(^ 2 j )); converges to ( 0, ^(x 2j - ) j . Consequently, due to a lack of competition 
from its nearest fitter neighbor x 2k + h, sub-population zL-i starts to recover, so does 

(21) 

x 2i-i f° r ever Y 1 < i < j- Thus, we obtain the new equilibrium configuration 

^— • ,i 2i-l *— • x 2i 

t=l i=j 

In conclusion, the new configuration T^ 2l+1 ^ is obtained by relabeling the traits as done in 



Definition 4. 1|(Y). □ 

Lemma . 



5.5 



shows that the mutation occurs on the time scale O(-). Recall from Section 
p that the time scale for fixation is O (in -) on a finite trait space. Combining them with 
the time scale separation constraint - 3> In - (heuristically introduced by Metz et al 
[T7] and mathematically developed by Champagnat [3]), the proof of Theorem 4.3 is as 
follows. 

Proof of Theorem \4-3\ For any non-negative LeN, let B^ L > be a measurable subset of 
X L , such that B^ := {x^ = (x { L) , . . . , xf ] ) : xJ L) -< . . . -< xf ] } as in Assumption (Dl). 
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Define, for any L = 21 even, 

and for L = 11 + 1 odd, 

n{x\ ') 



^{x\ ) for i even integer 
for i odd integer, 



for i even integer 

£(x\ ) for i odd integer. 



(5.55) 



(5.56) 



Then the support process of (T t )t>o i n Definition 4.1 denoted by (Z t )t>o, has the following 
infinitesimal generator 



Gtp{yF>) = / (<^(x (L+1) ) - v?(x (L) )) 0{x^)k(x^ l \ dh), 



(5.57) 



where f3{x yL ') = ^ n(x\ )fJ>(x\ ), and the probability kernel 



MX 



M 



^) = E 



„( L )^,,^w 



M4V(4_) ,il) 



k=0 



/3( x w; 



p(4 >°^), 



(5.58) 



{L+l) 

1 5 



, x L+1 ) is determined by x^ L ^ and p(x k , d/i) as in Definition 



4.1 



and x (L+1) = 

According to Definition 4.1 , let P^l) be the law of (Z t ) t > with initial state x^ L ^. Denote 
by (S n )n>i the sequence of occurrence times of mutations. By applying the strong Markov 
property at Si, we obtain 

P x(L) (S n <t<S n+1 ,Z t eB^ L+ ^) 
= f (3(x {L) ) exp- s/3 ( x(L) ) f P x(L+1) (S n ^ <t-s<S n , Z t _ s e fi( L+ ™)) k(x^ l \ dh)ds. 

Jo JheR d 



In particular, 



P X W (0 < t < S h Z t e £? (L) ) = l {x( L )eiJ (L) } exp (-t/3(x( L ))) 



(5.59) 



(5.60) 



The idea of our proof for the theorem is to show that the same relation as above holds 
for the rescaled processes (X^f) t >o as taking e — > when we replace Z t by Supp(X^f) 
and replace S n by the n-th jump time r^ of (X^f) t >o- 

Let P x (i) be the law of (X*f)t>o with initial state's support x^ L \ Consider the quantity 

P x „) (< < t < < +1 , Supp(X;p G B^) . (5.61) 

For n = 0, it is implied from Lemma [5. 5| that 

limP x(i) (o < t < rl Supp(X e t p E £? (L) ) = l {l ( i)£B(i ) } exp (-t/3(x^)) . (5.62) 
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For n > 1, by the strong Markov property at t(, we obtain 
F x(L) (< < t < < +1 , Supp(*tP e 5(L+n) ) 



2.". 



P x(i) (r* < t) / P x (l +1) ( <_ x < t - 7? < <, Supp(X e t >° ) g £ (L+n) ) K e (xM, dh) 
Jhm« \ -^ j 



where n e (x^ L \ dh) = Yl 



* ?; f (4 L) )M(4 L) M4 L) .*) , A n(xPMxP) P (x<f\dh) 



fc=0 i 



converges to ^2 

k=0 £ n(x[ L) )^xi L) ) 

k=0 



(5.63) 
as e — > 



due to Theorem 3.1 and the form of n(-) given in Eq.(5.55) and (5.56). 

Substituting the terms on the RHS of Eq.(5.63) by their limits when taking e — > 0, 
combining with (5.62) and (5.58), we obtain 



limP x(i) (< < t < < +1 , Supp{XtP e fi(L+n) 

= lim T /J(x< L >) exp (-^(x^)) /" V ^' ) ^ )) ^' &) 



L 

E 

fc=0 



fc=0 E.fit^rvt^') 



/3(x( L ))exp(-s/3(xW)) 
/ limP x(i+1) ( <_ x < t - s < <, Supp(X£! s ) G £ (L+n) J /t(x (L) , d/i)ds. 

Jh&Rd^O \ — J 



(5.64) 



By (5.60) and (5.62), we conclude that 



limP x(i) (0 < t < rl Supp(X e t p G £?^) = P xW (0 < t < S h Z t G B™) , (5.65) 
and moreover, by (5.59) and ( |5.64[ ), we conclude that 

limP x(i) (< < t < <+!, Supp(X e t p G £( L +")) = P x(i) (5 n < t < S n+1 , Z t e B^) . 

(5.66) 
Thus, by conditional probability and Lemma 5.6 for any e > 

limP x(L) (< < t < < +1 , 5upp(^) G s^+"), ||xj- - r^|| < e ) 

= limP x(i) (< < t < < +1 , fifcpp(X t %) G B(^)) P x( , +n) (\\Xfc - T^\\ < e) 

= P x(L) (S n <t<S n+1 ,Z t eB^ L+ ^). 

(5.67) 
Therefore, Theorem 4.3 is proved in the sense of one-dimensional time marginal distri- 
bution. It can be generalized to convergence in finite dimensional distribution similarly 
as in [3]. □ 
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6. Simulation algorithm 



The pathwise construction of the TST process defined in Definition |4.1| leads to the 
following numerical algorithm for simulation of the TST process. 



Step 0. Specify the initial condition: T = T^ ) = £,(x )5 X0 . 

Step 1. Simulate T\ exponential distributed with parameter £(x )/i(x ). Sample a new trait 

(xo+h) with density p(x , dh). If f(x +h, Xo) > 0, relabel x := x , x\ := x +h. 

Otherwise, relabel x := x + h, x\ := x . 

Set VW = ^(x?)5 (i), and T t = T<® for t e [0,n). 

x 1 

Step 2. Simulate r 2 exponential distributed with parameter £(x\ )fi(x\ ). 
Setr^r^ 1 ) fortG [r^n + rs). 

Sample a new trait (x| + ti\ with density p(x\ , dh). 
Choose one from the following to carry out: 

— if f(x\ + h,x[ ) > 0, relabel x := x , x\ := x\ , x 2 := x[ + h; 

— if f(x? + h,x^ ] ) < 0, /(x^ + h,x { o } ) > 0, relabel xf := xf, xf := 

rr> V ' I y-\ rr> > ' ry* * / • 

Jb 1 \ I I* Jury . Jb 1 . 

— if f{x\ + h,x ) < 0, relabel x .= x[ + h, x[ .= x , x 2 '■— x[ . 



Setr( 2 ) = e(4 2) )^)+e(4 2) )i«- 



"2 

Step 21+1. Generate t( 2 ' +1 ) from T™ = £ £(xj>f)<5 eo for I = 1,2 



5 -"5 



i=0 



,(2I)n / (2Q> 



Simulate r 2 ; + i exponential distributed with parameter £ £(4i VO^i ;• 

i=0 

21 2l+1 

Set r t = T^ for t G [ X] r «; X] r *)- Select one trait x 2k , for any < fe < /, 

ffx' 2 '') u(x' 2! ') / (21) \ 

to mutate with probability — ; - fc (20 2k (20 ■ Sample a new trait (x!^. + ^) with 

2~Jj=0 5(^21 )v( x 2i ) 

probability density p(x!, fc , <i/i). Choose one from the following three cases to carry 
out: 

- if f(x ( 2 f+h, 4?) > 0, relabel xgjj 1} := x£°+/i, xf +1) := xf ° for < i < 21; 

- if f(x ( 2 f + h,xf ] ) < 0, relabel xf +1) := xf j for 1 < i < 21 + 1, and 

(21+1) ._ (21) , . 

x - — x 2fc "r "') 

- otherwise, there exists < j < I s.t. f{x 2k + h,x 2i ) < for j < i < I, and 
f(x 2k + h,x 2 j ) > 0. Furthermore, 

* if f{x ( 2 f + h, x { 2 f +1 ) < 0, relabel xf +1) := xf ° for < i < 2j , xf l+1) := 
xfl\ for 2j + 2 < i < 21 + 1, and xg+i := 4f + ^; 

* if /(4f + /i,4?+i) > 0, relabel xf l+l) := xf l) for < i < 2j + 1 , 



,(2J+1) ._ (21) f _ , , q ^ , ^ 07 , ! _^ „(2l+l) ._ J21) 
~ x i-l 

,(2i+l)> 



.r, ' := xH for 2j + 3 < i < 2/ + 1, and xg'^^ := x^. J + h. 
1+1 



Set r( 2 ^) = £ e(4ti V 3 



8=1 



(81+1). 
"2i-l 
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l+l 



Step 21+2. To generate T( 2 '+ 2 ) from T( 2 ' +1 ) = Y] £(4l-i M ( m+d for Z = 1, 2, • • • . 

This can be done as similar as the induction from r' 2 " to r*- 2 '"*" 1 ^. So forth. 



Appendix A. Stability of a Lotka-Volterra system 
Consider a Lotka-Volterra system (n(x),n(y)) satisfying the following equations. 

n t (x) = (b(x) - d(x) - a(x, x)n t (x) - a(x, y)n t (y)) n t (x) ,^ ^ 

ht(y) = (b(y)-d(y)-a(y,x)n t (x)-a(y,y)n t (y))n t (y). 

Suppose that n Q (x),n (y) > and f(y,x) := b(y) — d(y) — a(y,x)n(x) > 0, n(x) = 
a(x x) ' an< ^ ^ S symmetric form f(x,y) < 0. Then we conclude that (0,n(y)) is the only 
stable point. 

In fact, there are four fixed points of above system, namely, (0, 0), (n(x), 0), (0, n(y)), 
and (n*(x),n*(y)), where (n* (x) , n* (y)) is such that 

b(x) — d(x) — a(x,x)n t (x) — a(x,y)n t (y) = 
Kv) - d (v) - a (yi x)n t {x) - a(y, y)n t (y) = 0. 
By simple calculation, we obtain that 

n *( x ) = <*(y,y)f(x,y) 

\ / a(x,x)a(y,y)— a(x,y)a(y,x) 

n *( x ) = a(x,x)f(y,x) 

\ / a(x,x)a(y,y)—a(x,y)a(y,x) ' 

To make sense of the solution as a population density (which must be non-negative), one 
needs f(x,y) ■ f(y,x) > 0. It contradicts the assumption f(x,y) < 0,f(y,x) > 0. We 
thus exclude the solution (n*(x),n*(y)). 

The Jacobian matrix for the system (A.l[) at point (0, 0) is 



b(x) — d(x) 

b(y) - d(y) 

Obviously its eigenvalues are both positive. Thus (0, 0) is unstable. 
The Jacobian matrix at point (n(x),0) is 

(b(x) — d(x)) —a(x,y)n(x) 

b(y)-d(y)-a(y,x)n(x) 

- (b(x) — d(x)) —a(x, y)n{x) 
o f(y,x) 

Since one of its eigenvalue — (b(x) — d(x)) is negative whereas the other one is f(y, x) > 0, 
the equilibrium (n(x),0) is unstable. 



The Jacobian matrix of system (A.l) at point (0,n(y)) is 



b{x)-d{x)-a(x,y)n{y) \=( f( x 'V) ° 

-a(y,x)n{y) - (b(y) - d{y)) ) \ -a{y,x)n{y) - (b(y) - d(y)) 

whose eigenvalues are both negative because of the condition f(x,y) < 0. Thus (0, n(y)) 



is the only stable equilibrium of the system (A.l). 
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